Fast protein binding site comparisons using visual words representation
نویسندگان
چکیده
MOTIVATION Finding geometrically similar protein binding sites is crucial for understanding protein functions and can provide valuable information for protein-protein docking and drug discovery. As the number of known protein-protein interaction structures has dramatically increased, a high-throughput and accurate protein binding site comparison method is essential. Traditional alignment-based methods can provide accurate correspondence between the binding sites but are computationally expensive. RESULTS In this article, we present a novel method for the comparisons of protein binding sites using a 'visual words' representation (PBSword). We first extract geometric features of binding site surfaces and build a vocabulary of visual words by clustering a large set of feature descriptors. We then describe a binding site surface with a high-dimensional vector that encodes the frequency of visual words, enhanced by the spatial relationships among them. Finally, we measure the similarity of binding sites by utilizing metric space operations, which provide speedy comparisons between protein binding sites. Our experimental results show that PBSword achieves a comparable classification accuracy to an alignment-based method and improves accuracy of a feature-based method by 36% on a non-redundant dataset. PBSword also exhibits a significant efficiency improvement over an alignment-based method.
منابع مشابه
PBSword: a web server for searching similar protein–protein binding sites
PBSword is a web server designed for efficient and accurate comparisons and searches of geometrically similar protein-protein binding sites from a large-scale database. The basic idea of PBSword is that each protein binding site is first represented by a high-dimensional vector of 'visual words', which characterizes both the global and local shape features of the binding site. It then uses a sc...
متن کاملGenProBiS: web server for mapping of sequence variants to protein binding sites
Discovery of potentially deleterious sequence variants is important and has wide implications for research and generation of new hypotheses in human and veterinary medicine, and drug discovery. The GenProBiS web server maps sequence variants to protein structures from the Protein Data Bank (PDB), and further to protein-protein, protein-nucleic acid, protein-compound, and protein-metal ion bindi...
متن کاملDirected Blocking of TGF-β Receptor I Binding Site Using Tailored Peptide Segments to Inhibit its Signaling Pathway
Background: TGF-β isoforms play crucial roles in diverse cellular processes. Therefore, targeting and inhibiting TGF-β signaling pathway provides a potential therapeutic opportunity. TGF-β isoforms bind and bring the receptors (TβRII and TβRI) together to form a signaling complex in an ordered manner. Objectives: Herein, an antagonistic variant of TGF-β (AnTβ)...
متن کاملInfluence of time-restricted feeding schedule on daily rhythm of abcb1a gene expression and its function in rat intestine
214 words Introduction; 488 words Discussion; 931 words d) Abbreviations: ADx, adrenalectomized; DBP, D-site biding protein; E4BP4, E4 promoter binding protein-4; HLF, hepatic leukemia factor; P-gp, P-glycoprotein; ZT, zeitgeber time e) Recommended section: Metabolism, Transport and Pharmacogenomics This article has not been copyedited and formatted. The final version may differ from this versi...
متن کاملPalarimetric Synthetic Aperture Radar Image Classification using Bag of Visual Words Algorithm
Land cover is defined as the physical material of the surface of the earth, including different vegetation covers, bare soil, water surface, various urban areas, etc. Land cover and its changes are very important and influential on the Earth and life of living organisms, especially human beings. Land cover change monitoring is important for protecting the ecosystem, forests, farmland, open spac...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Bioinformatics
دوره 28 10 شماره
صفحات -
تاریخ انتشار 2012